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CATGGTAGACGGCTGCCCGGAGGGACCACGCGTCTGAGACCGGCGATCGGACCGCCAAAACCATGGGTAGCAATCGGGGCCGCAAG 
GCCGGAGGGGGCTCTCAGGACTTCGGCGCGGGACTCAAGTACAACTCCCGGCTAGAGAACATGAATGGCTTTGAGGAGGGTGTGGA 
GTTCCTGCCTGCGAACAATGCCAAGAAAGTGGAGAAGCGAGGCCCCAGGCGCTGGGTGGTGCTGGTGGCAGTGCTGTTCAGCTTCC 
TCTTGCTCTCCCTCATGGCTGGCTTGCTGGTGTGGCACTTCCATTATCGGAATGTGCGGGTTCAAAAAGTCTTCAATGGCCATCTG 
# AGGATCACAAATGAGATCTTTCTGGATGCGTATGAGAACTCCACCTCCACAGAGTTTATCAGCCTGGCCAGCCAGGTGAAGGAGGC 
*GCTGAAGCTGCTGTACAATGAAGTCCCTGTCCTGGGTCCCTACCACAAGAAGTCGGCTGTAACTGCCTTCAGTGAGGGCAGTGTCA 
TCGCCTACTACTGGTCAGAGTTCAGCATCCCCCCACACCTGGCAGAAGAGGTTGATCGCGCCATGGCTGTGGAGCGAGTTGTAACA 
TTGCCACCCCGAGCACGGGCACTGAAATCCTTCGTGCTAACATCTGTGGTGGCCTTCCCCATTGACCCCAGAATGCTGCAGAGGAC 
TCAGGACAACAGCTGCAGTTTTGCCCTGCATGCCCATGGTGCAGCAGTGACACGCTTCACTACCCCTGGCTTCCCCAACAGTCCCT 
ACCCGGCGCATGCCCGCTGCCAGTGGGTCCTGCGGGGGGACGCCGACTCTGTGCTGAGCCTCACCTTCCGAAGCTTTGATGTCGCT 
CCCTGTGATGAGCATGGCAGTGACCTGGTCACCGTGTATGATAGCCTGAGCCCCATGGAACCCCACGCTGTGGTGCGGCTGTGTGG 
CACCTTCTCACCCTCCTACAACCTGACTTTCCTCTCCTCCCAGAACGTCTTCCTTGTCACGCTGATAACCAATACTGACCGGCGAC 
ATCGTGGCTTTGAGGCCACTTTCTTCCAGCTGCCCAAGATGAGCAGCTGTGGCGGCTTTTTGAGTGACACCCAAGGGACATTTAGC 
AGCCCCTACTATCCAGGCCACTACCCGCCCAACATCAACTGCACATGGAATATCAAGGTGCCCAACAACCGGAACGTGAAGGTGCG 
CTTCAAACTCTTCTATCTGGTGGACCCCAACGTACCAGTGGGCTCCTGCACCAAGGACTATGTGGAGATCAACGGGGAGAAGTACT 
GCGGTGAGAGGTCCCAGTTTGTGGTGAGCAGCAACAGCAGCAAGATTACAGTCCACTTCCATTCTGATCACTCGTACACGGACACC 
GGGTTCCTAGCTGAGTACCTCTCCTACGACTCCAACGACCCGTGCCCAGGGATGTTCATGTGCAAGACTGGACGGTGCATCCGAAA 
GGAACTGCGCTGCGACGGCTGGGCAGACTGCCCGGATTATAGTGATGAGCGTTACTGCCGATGCAATGCCACCCACCAGTTCACGT 
GC AAAAAC C AG TT C TGC AAGC C C C T C T TC TGGGTC TG TGAC AGT GT C AAC GAC TGTGGGGAC GGAAG TGAC G AGGAGGG C TGC AGC 
TGTC C TGC TGGGAGTTTC AAGTGTTCC AATGGGAAGTGTC TC CC TC AGAGC C AGAAGTGTAATGGGAAGGACAACTGTGGAGATGG 
GTCTGACGAGGCTTCATGTGACAGCGTGAATGTCGTCTCTTGCACCAAATATACCTACCGCTGCCAAAATGGCCTCTGTCTGAGCA 
AGGGCAACCCTGAGTGTGATGGGAAGACGGACTGTAGCGATGGCTCCGATGAGAAAAACTGTGACTGTGGGCTGCGATCCTTTACC 
AAACAGGCTCGCGTGGTTGGTGGCACGAATGCGGACGAGGGCGAGTGGCCCTGGCAGGTGAGCCTCCACGCCCTGGGCCAGGGCCA 
CTTGTGTGGGGCCTCGCTCATCTCTCCTGACTGGCTGGTCTCTGCAGCTCATTGCTTTCAGGATGACAAAAATTTCAAGTACTCAG 
ACTACACGATGTGGACGGCCTTCCTGGGTCTGCTGGACCAGAGCAAGCGCAGTGCCTCTGGGGTGCAGGAGCTGAAGCTCAAACGT 
ATCATCACCCACCCTTCCTTCAATGATTTCACCTTCGACTATGACATCGCCTTGCTGGAGCTGGAGAAGTCGGTGGAGTACAGCAC 
CGTCGTGCGCCCCATCTGCCTGCCTGATGCTACCCATGTCTTCCCTGCTGGCAAGGCCATCTGGGTCACAGGCTGGGGGCACACAA 
AAGAGGGAGGTACCGGAGCGCTGATCCTGCAGAAGGGTGAGATCCGTGTCATCAACCAGACCACCTGTGAGGACCTCATGCCGCAG 
CAGATCACCCCACGAATGATGTGTGTGGGTTTCCTCAGTGGGGGTGTGGACTCCTGCCAGGGTGACTCTGGTGGCCCCTTGTCAAG 
CGCGGAGAAAGATGGGCGAATGTTCCAGGCTGGTGTGGTGAGCTGGGGTGAAGGCTGCGCTCAGAGGAACAAGCCAGGCGTGTACA 
CAAGGCTCCCTGTAGTTCGGGACTGGATCAAAGAGCACACTGGGGTATAGCAGCATGGACAGACAGCCGACCACAAACACCCACAG 
GGATGCCCGACATGCACACCTGGATACAGGAGAGGAACACTGACGACATTTATGCTGTGGCCTCCCCCCCCCAACACAACCCAGAC 
TGTGAACTGCATCCTTAGGACTCAGAGTTCTTCCAAAGTGGGACCCCCTCAAGAGTTGGAGAGAGAACTTGCGTGCTAGCGGCCCA 
GCCTGGGGGCAAGGGTTTGATGGCAGCCTTCCCCCTCTAGCCCTGAGCTGGGTGAAGATGATGCTGTCCCGGAGAGCTGCTTCCAA 
CTGTCATTGAGCTCCCGGGAGCCCTATGGGAGGAGGGGCTCAGGGTCACTCTTTTCAGGAAGCGCCAGCCCTAGGAACCCCAGAAA 
AGAGTGGTACCTAAGGCTGAAATTGTTTTGCTGTTGCCAGGGGTGGGTATTTGAGAGTAAAACATTTTATTTCTTTTTAAAAAAAA 
AAAAAAAAAA (SEQ ID N0:1) 

MGSmGRKAGGGSQDFGAGLKYNSRLENMHGFEEGVEFLPAMNA 

KKVEKRGPRRWWLVAVLFSFLLL SLMAGLLVWHFHYRWRVQKVFNGHLRI TNEIFL 

DAYENSTSTEFISLASQVKEALKLLYNEVPVLGPYHKKSAVTAFSEGSVIAYYWSEFS 

IPPHLAEEVDRAMAVERWTLPPRARALKSFVLTSWAFPIDPRMLQRTQDNSCSFAL 

HAHGAAVTRFTTPGFPNSPYPAHARCQWVLRGDADSVLSLTFRSFDVAPCDEHGSDLV 

T VYD S L S PME PHAWRLCGTFS PS YNLT FL S SQNVFL VTL I TNTDRRHPGF E ATFFQL 

PKMSSCGGFLSDTQGTFSSPYYPGHYPPNINCTWNIKVPNNR1WKTOFKLFYLTO 

P VG S C TKD YVE ING EK YCGERS QFWSSNS SKI T VHFHSDHS YTDTGFL AS YL S YDSN 

DPC PGMFMCKTGRC IRKELRCDGWADC PDYSDERYCRCNATHQFTCKNQFCKPLFWVC 

DSVWDCGDGSDEEGCSCPAGSFKCSNGKCLPQSQKCMGKDNCGDGSDEASCDSVNWS 

CTKYTYRCQNGLCLSKGNPECDGKTDCSDGSDEKNCDCGLRSFTKQARWGGTNADEG 

EWPWQVSLHALGQGHLCGASLISPDWLVSAAHCFQDDKNFKYSDYTMWTAFLGLLDQS 

KRSASGVQELKLKRIITHPSFNDFTFDYDIALLELEKSVEYSTWRPICLPDATHVFP 

AGKAIWVTGWGHTKEGGTGALILQKGEIRVINQTTCEDLMPQQITPRMMCVGFLSGGV 

DSCQGDSGGPLSSAEKDGRMFQAGWSWGEGCAQRNKPGVYTRLPWRDWIKEHTGV 

(SEQ ID NO: 2) 
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underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



CATGGTAGACGGCTGCCCGGAGGGACCACGCGTCTGAGACCGGCGATCGGACCGCCAAAA 
CCATGGGTAGCAATCGGGGCCGCAAGGCCGGAGGGGGCTCTCAGGACTTCGGCGCGGGAC 
TCAAGTACAACTCCCGGCTAGAGAACATGAATGGCTTTGAGGAGGGTGTGGAGTTCCTGC 
CTGCGAACAATGCCAAGAAAGTGGAGAAGCGAGGCCCCAGGCGCTGGGTGGTGCTGGTGG 
CAGTGCTGTTCAGCTTCCTCTTGCTCTCCCTCATGGCTGGCTTGCTGGTGTGGCACTTCC 
ATTATCGGAATGTGCGGGTTCAAAAAGTCTTCAATGGCCATCTGAGGATCACAAATGAGA 
TCTTTCTGGATGCGTATGAGAACTCCACCTCCACAGAGTTTATCAGCCTGGCCAGCCAGG 
TGAAGGAGGCGCTGAAGCTGCTGTACAATGAAGTCCCTGTCCTGGGTCCCTACCACAAGA 
AGTCGGCTGTAACTGCCTTCAGTGAGGGCAGTGTCATCGCCTACTACTGGTCAGAGTTCA 
GCATCCCCCCACACCTGGCAGAAGAGGTTGATCGCGCCATGGCTGTGGAGCGAGTTGTAA 
CATTGCCACCCCGAGCACGGGCACTGAAATCCTTCGTGCTAACATCTGTGGTGGCCTTCC 
CCATTGACCCCAGAATGCTGCAGAGGACTCAGGACAACAGCTGCAGTTTTGCCCTGCATG 
CCCATGGTGCAGCAGTGACACGCTTCACTACCCCTGGCTTCCCCAACAGTCCCTACCCGG 
CGCATGCCCGCTGCCAGTGGGTCCTGCGGGGGGACGCCGACTCTGTGCTGAGCCTCACCT 
TCCGAAGCTTTGATGTCGCTCCCTGTGATGAGCATGGCAGTGACCTGGTCACCGTGTATG 
ATAGCCTGAGCGCCATGGAACCCCACGCTGTGGTGCGGCTGTGTGGCACCTTCTCACCCT 
CCTACAACCTGACTTTCCTCTCCTCCCAGAACGTCTTCCTTGTCACGCTGATAACCAATA 
CTGACCGGCGACATCCTGGCTTTGAGGCCACTTTCTTCCAGCTGCCCAAGATGAGCAGCT 
GTGGCGGCTTTTTGAGTGACACCCAAGGGACATTTAGCAGCCCCTACTATCCAGGCCACT 
ACCCGCCCAACATCAACTGCACATGGAATATCAAGGTGCCCAACAACCGGAACGTGAAGG 
TGCGCTTCAAACTCTTCTATCTGGTGGACCCCAACGTACCAGTGGGCTCCTGCACCAAGG 
ACTATGTGGAGATCAACGGGGAGAAGTACTGCGGTGAGAGGTCCCAGTTTGTGGTGAGCA 
GCAACAGCAGCAAGATTACAGTCCACTTCCATTCTGATCACTCGTACACGGACACCGGGT 
TCCTAGCTGAGTACCTCTCCTACGACTCCAACGACCCGTGCCCAGGGATGTTCATGTGCA 
AGAC TGGACGGTGC ATC CGAAAGGAAC TGC GCTGCGAC GGC TGGGC AGAC TGC C CGGATT 
ATAGTGATGAGCGTTACTGCCGATGCAATGCCACCCACCAGTTCACGTGCAAAAACCAGT 
TCTGCAAGCCCCTCTTCTGGGTCTGTGACAGTGTCAACGACTGTGGGGACGGAAGTGACG 
AGGAGGGC TGC AGC TGTC C TGC TGGGAGTTTC AAGTGTTC C AATGGG AAGTGTC TC C C TC 
AGAGCCAGAAGTGTAATGGGAAGGACAACTGTGGAGATGGGTCTGACGAGGCTTCATGTG 
AC AGCGTGAATGTC GTC TC TTGCAC C AAATATAC CTACC GCTGCC AAAATGGC CTCTGTC 
TGAGC AAGGGC AAC C C TGAGTGTGATGGGAAGACGGAC TGTAGC GATGGC TC CGATGAGA 
AAAACTGTGACTGTGGGCTGCGATCCTTTACCAAACAGGCTCGCGTGGTTGGTGGCACGA 
ATGCGGACGAGGGCGAGTGGCCCTGGCAGGTGAGCCTCCACGCCCTGGGCCAGGGCCACT 
TGTGTGGGGCCTCGCTCATCTCTCCTGACTGGCTGGTCTCTGCAGCTCATTGCTTTCAGG 
ATGACAAAAATTTCAAGTACTCAGACTACACGATGTGGACGGCCTTCCTGGGTCTGCTGG 
AC C AGAGC AAGCGC AGTGC CTCTGGGGTGC AGGAGC TGAAGC TC AAAC GTATC ATC AC CC 
ACCCTTCCTTCAATGATTTCACCTTCGACTATGACATCGCCTTGCTGGAGCTGGAGAAGT 
CGGTGGAGTACAGCACCGTCGTGCGCCCCATCTGCCTGCCTGATGCTACCCATGTCTTCC 
CTGCTGGC AAGGCC ATCTGGGTC AC AGGCTGGGGGC ACAC AAAAGAGGGAG [ GTACCGGA 
GC GC TGATCC TGC AGAAGGGTGAGATC CGTGTCATCAACC AGACC ACC TGTGAGGACC TC 
ATGCCGCAGCAGATCACCCCACGAATGATGTGTGTGGGTTTCCTCAGTGGGGGTGTGGAC 
TCCTGC ] CAGGGTGACTCTGGTGGCCCCTTGTCAAGCGCGGAGAAAG [ATGGGCGAATGT 
TCCAGGCTGGTGTGGTGAGCTGGGGTGAAGGCTGCGCTCAGAGGAACAAGCCAGGCGTGT 
ACACAAGGCTCCCTGTAGTTCGGGACTGGATCAAAGAGCACACTGGGGTATAGCAGCATG 
GACAGAC AGCCGACC ACAAACACC CAC AGGGATGCC CGAC ATGCACACCTGGATAC AGGA 
GAGGAAC ACTGAC GAC ATTTATGC TGTGGC CTC C CC C CC C C AAC AC AACCC AGAC TGTGA 
ACTGCATCCTTAGGACTCAGAGTTCTTCCAAAGTGGGACCCCCTCAAGAGTTGGAGAGAG 
AACTTGCGTGCTAGCGGCCCAGCCTGGGGGCAAGGGTTTGATGGCAGCCTTCCCCCTCTA 
GCCCTGAGCTGGGTGAAGATGATGCTGTCCCGGAGAGCTGCTTCCAACTGTCATTGAGCT 
CCCGGGAGCCCTATGGGAGGAGGGGCTCAGGGTCACTCTTTTCAGGAAGCGCCAGCCCTA 
GGAACCCCAGAAAAGAGTGGTACCTAAGGCTGAAAT] TGTTTTGCTGTTGCCAGGGGTGG 
GTATTTGAGAGTAAAACATTTTATTTCTTTTTAAAAAAAAAAAAAAAAAA 
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Gene Sequence Structure 

* 



2466 bp 



Sequence Deleted 



2505 bp 



Size of full-length 
cDNA: 3106 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 2035 



Arm Length- 
s': 3.8 kb 
3': 1 kb 



S* arm 



Neo 
Cassette 



3 s arm 




Targeting Vector 

• Endogenous Locus 

* Not drawn to scale 



5 1 >TTCCCCATTGAGACTGGCTTA 
CCCCGGAAGCTGCCTGCCTCAGTC 
TCCCGCTTCCTGTCTCCCCAGGTA 
CCGGAGCGCTGATCCTGCAGAAGG 
GTGAGATC CGTGTC ATC AACC AGA 
C C AC C TGTGAGG AC C TC ATGC CGC 
AGCAGATCACCCCACGAATGATGT 
GTGTGGGTTTCC TCAGTGGGGGTG 
TGGACTCCTGC<3 ■ 
(SEQ ID NO: 3) 



5 ■ >ATGGGCGAATGTTCCAGGCTG 
GTGTGGTGGGCTGGGGTGAAGGCT 
GCGCTCAGAGGAACAAGCCAGGCG 
TGTACACAAGGCTCCCTGTAGTTC 
GGGACTGGATCAAAGAGCACACTG 
GGGTATAGCAGCATGGACAGACAG 
CCGACCACAAACACCCACAGGGAT 
GC CC GAC ATGC AC AC C TGGATAC A 
GGAGAGGGACAO ■ 
(SEQ ID NO: 4) 
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